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SEQUENCE LISTING 




(1) GENERAL INFORMATION: 

(i) APPLICANT: ASHIKARI, Toshihiko 
TANAKA, Yoshikazu 
FUJIWARA , Hiroyuki 
NAKAO , Ma s ahi r o 
FUKUI , YLiko 
SJ\KAKIBARA, Keiko 
MI2UTANI , Masako 
KUSUMI , Takaaki 

(ii) TITLE OF INVENTION: GENE ENCODING A PROTEIN HAVING ACYL 
GROUP TRANSFER ACTIVITY 

(iii) NUMBER OF SEQUENCES: 31 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BURNS, DOANE , ^SW^CKER & MATHIS, L.L.P. 

(B) STREET: P.O. Box 1404 / 

(C) CITY: Alexandria £^ 

(D) STATE: Virginia 

(E) COUNTRY: United^ates 

(F) ZIP: 22313-1404 

(v) COMPUTER READABLE FORM:/ 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS tADOS 

(D) SOFTWARE: Patent In Release/#1 . 0 , Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION^ NUMBER: US 08/8 94,3 56 

(B) FILING DATE: 18 -AUG- 1997 

(C) CLASSIFICATION: 




GAT 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 7-67159 

(B) FILING DATE: 17-FEB-1995 



/ 



(vii) PRIOR APPLICATION DATA: 

(A) APP^LICATION NUMBER: JP 7-196915 

(B) FILING DATE: 29-JUN-1995 

(vii) PRIOR^PPLICATION DATA: 

(A) /application NUMBER: JP 8-46534 

(B) FILING DATE: 30-JAN-1996 

(vii) PRIOR APPLICATION DATA: 

/(A) APPLICATION NUMBER: WO PCT/ JP96/00348 
(B) FILING DATE: 16-FEB-1996 

(viii)y ATTORNEY/ AGENT INFORMATION: 

/ 
/ 
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(A) NAME: Meuth, Donna M. 

(B) REGISTRATION NUMBER: 36,607 

(C) REFERENCE /DOCKET NUMBER: 001560-308 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 836-6620 

(B) TELEFAX: (703) 836-2021 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to thkNA 



(iii) HYPOTHETICAL: NO 
( iv) ANTI - SENSE : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gentiana 
(F) TISSUE TYPE: petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pGAT4 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 6 . .X412 



flora var . japonica 



(xi) SEQUENCE DESCRIP/TION : SEQ ID NO : 1 : 



TCATT ATG GAG CAA ATC 
Met Glu Gin lie 
1 



ATG GTG AAG GTT CTT GAA AAA TGC CAA 
Met Val Lys Val Leu Glu Lys Cys Gin 
5 10 



47 



GTT ACA CCA CCA TCT GAC ACA ACA GAT GTC GAG TTA TCG CTA CCG GTA 
Val Thr Pro Pro Ser Afep Thr Thr Asp Val Glu Leu Ser Leu Pro Val 
15 /20 25 30 



95 



ACA TTC TTC GAT ATG CCC TGG TTG CAC TTG AAT AAG ATG CAG TCC CTT 



Thr Phe Phe Asp lie 
35 

CTG TTT TAC GAC TTT 
Leu Phe Tyr Asp Phe 
50 



?ro Trp Leu His Leu Asn Lys Met Gin Ser Leu 
40 45 

CG TAC CCA AGA ACA CAT TTC TTG GAC ACT GTT 
Pro Tyr Pro Arg Thr His Phe Leu Asp Thr Val 
55 60 



143 



191 



52 



ATC CCT AAT CTT AAG GCC TCT TTG TCT CTC ACT CTA AAA CAC TAG GTT 
lie Pro Asn Leu Lys Ala Ser Leu Ser Leu Thr Leu Lys His Tyr Val 
65 70 75 



239 



CCG CTT AGC 
Pro Leu Ser 
80 



GGA AAT 
Gly Asn 



TTG TTG 
Leu Leu 
85 



ATG CCG ATC AAA 
Met Pro lie Lys 



TCG GGC GAA 
Ser Gly Glu 
90 



ATG CCG 
Met Pro 



287 



AAG TTT CAG 
Lys Phe Gin 
95 



TAG TCC CGT GAT GAG GGC GAC TCG 
Tyr Ser Arg Asp Glu Gly Asp Ser 
100 105 



ATA ACT/TTG 
lie Thr Leu 



ATC GTT 
lie Val 
110 



335 



GCG GAG TCT 
Ala Glu Ser 



GAC CAG GAT TTT GAC TAC CTT AAA GGT CAT CAA CTG GTA 
Asp Gin Asp Phe Asp Tyr Leu Lys .Gly His Gin Leu Val 
115 120 / 125 



383 



GAT TCC AAT 
Asp Ser Asn 



AGG ACC ATG 
Arg Thr Met 
145 

ACC GTT TTT 
Thr Val Phe 
160 

TCA ATT GCA 
Ser lie Ala 
175 

TAT ATT AAC 
Tyr lie Asn 



CTT CCA TCT 
Leu Pro Ser 



GAA ACA TTT 
Glu Thr Phe 
225 

TTT GGA AGC 
Phe Gly Ser 
240 

CTC TCC CTT 
Leu Ser Leu 
255 

AGA GGA TCC 
Arg Gly Ser 



GAT TTG 

Asp Leu 
130 

CAA GAC 

Gin Asp 



CCT AAC 
Pro Asn 



GAT GCT 
Asp Ala 



AAA TTT 
Lys Phe 
195 

TTC GAT 
Phe Asp 
210 

TGG AAC 
Trp Asn 



AAA CCC 
Lys Pro 



CAT GGC 
His Gly 



TAT AAA 
Tyr Lys 



CGT GGC 
Arg Gly 
165 



CTT TTT TAT ^jGTT 
Leu Phe Ty/ Val 
135 



GTG ATG CCG CTC 
Val 11^ Pro Leu 
150 





TTT GTA ATG TTC 
Phe Val Met Phe 



185 



GAC GCG GAC TTG 
Asp Ala Asp Leu 
200 



TA ATC AAA GAT 
lie lie Lys Asp 
215 

CAA GAT GTT CTT 
Gin Asp Val Leu 
230 

TTC AAC KN^ GTA 
Phe Asn Lys Val 



^CA ATA 
He 



AAG CTA AAG AAC 
Lys Leu Lys Asn 
265 

CGT GTA ACG ACG 
Arg Val Thr Thr 
280 



ATG CCA CGG 
Met Pro Arg 
140 

GTA GCC GTG 
Val Ala Val 
155 

CTG ACG GCA 
Leu Thr Ala 
170 

ATC AAT GCT 
lie Asn Ala 



TTG TCC GCG 
Leu Ser Ala 



CTG TAT GGC 
Leu Tyr Gly 
220 

GAA ATG TTC 
Glu Met Phe 
235 

CGA GCT ACA 
Arg Ala Thr 
250 

AAA GTA CTG 
Lys Val Leu 



TTC ACA ATG 
Phe Thr Met 



GTT ATA 
Val He 



CAA GTA 
Gin Val 



CAT CAT 
His His 



TGG GCC 
Trp Ala 
190 

AAT CTT 
Asn Leu 
205 

CTA GAG 
Leu Glu 



TCT AGA 
Ser Arg 



TAT GTC 
Tyr Val 



AAT CTC 
Asn Leu 
270 

ACG TGT 
Thr Cys 
285 



431 



479 



527 



575 



623 



671 



719 



767 



815 



863 
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GGA TAG GTA TGG ACA TGC ATG GTC AAA TCA AAA GAT GAG GTC GTA/TCA 

Gly Tyr Val Trp Thr Cys Met Val Lys Ser Lys Asp Asp Val Va/ Ser 

290 295 300 

GAG GAA TCA TOG AAC GAG GAA AAT GAG GTC GAG TAG TTC AGT/TTT ACA 

Glu Glu Ser Ser Asn Asp Glu Asn Glu Leu Glu Tyr Phe Se^ Phe Thr 

305 310 315 

GCG GAT TGC CGA GGA CTT CTG ACG CCC CCG TGT CCG CCT/aAC TAC TTT 

Ala Asp Cys Arg Gly Leu Leu Thr Pro Pro Cys Pro Pr^ Asn Tyr Phe 

320 325 330 

GGC AAC TGT CTT GCG TCA TGC GTT GCA AAA GGA ACA/CAT AAA GAG TTA 

Gly Asn Cys Leu Ala Ser Cys Val Ala Lys Ala Th/ His Lys Glu Leu 

335 340 345 / 350 

GTT GGG GAT AAA GGG CTT CTT GTT GCA GTT GCa/gCT ATT GGA GAA GCC 

Val Gly Asp Lys Gly Leu Leu Val Ala Val Alaf Ala lie Gly Glu Ala 



355 



360 V 365 



ATT GAA AAG AGG TTG CAC AAC GAA AAA GJ 
lie Glu Lys Arg Leu His Asn Glu Lys 
370 375 



(GTT CTt GCA GAT GCA AAA 
Val Lgiti Ala Asp Ala Lys 
380 



ACT TGG TTA TCG GAA TCT AAT GGA ATC^ 
Thr Trp Leu Ser Glu Ser Asn Gly Hi 
385 390 



:CT TCP/I^^ AGA TTT CTC GGG 
Pro Se/ Lys Arg Phe Leu Gly 
395 



ATT ACC GGA TCG CCT AAG TTC GAT 
lie Thr Gly Ser Pro Lys Phe 
400 405 



CG TAT/GGT GTA GAT TTT GGA TGG 
Asp/ Ser Tyr/ Gly Val Asp Phe Gly Trp 
410 



GGA AAG CCT GCA AAA TTT GAC ATT ACC TCT GTT GAT TAT GCA GAA TTG 
Gly Lys Pro Ala Lys Phe Asp/lle Thr Ser Val Asp Tyr Ala Glu Leu 
415 420 / 425 430 

ATT TAT GTG ATT CAG TCC A^G GAT TTT GAA AAA GGT GTG GAG ATT GGA 
lie Tyr Val lie Gin Ser yArg Asp Phe Glu Lys Gly Val Glu lie Gly 
435 / 440 445 

GTA TCA TTG CCT AAG A^T CAT ATG GAT GCA TTT GCA AAA ATC TTT GAA 
Val Ser Leu Pro Lys /le His Met Asp Ala Phe Ala Lys lie Phe Glu 
450 / 455 460 

GAA GGC TTT TGC TG^T TTG TCA TAGTCTCTTT AATAGAACCA TATTTGCTGC 
Glu Gly Phe Cys ^er Leu Ser 
465 

AATAAAGTAC CAA^TCCTTT AGTAACACTA CACCAAACCC TACTTTCGAG GCGGGAACAC 
CACAACGAGG TTCAATCACT AGAAGGTTGT ACTTCATAAA TTCCAGAGGT CGAATATACA 
CCGTTGTCCT /tGAAAAGTT GAACCTCACA CCTGACATGG TGTTACGATA GGTATTGTAT 
AATGCCATT^ TATACTTCCA TAAAGTATCC TATGCAATAG AGAACATGTT ATGTGTTAAA 



911 

959 

1007 

1055 

1103 

1151 

1199 

1247 

1295 

1343 

1391 

1442 

1502 
1562 
1622 
1682 
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AAAAAAAAAA AAAAAAAAAA A 



1703 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gentiana tri^lora va. y 
(F) TISSUE TYPE: petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pGAT106 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 35.. 1471 



iica 



(xi) SEQUENCE DESCRIPTION: 
GAACCATTGA ATCCAATTAA TCTGATTT^ 



TAAG ATG GCA GGA AAT TCC GAG 
Met Ala Gly Asn Ser Glu 
1 5 
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GAT ATC 
Asp lie 



GTC GCC 
Val Ala 



ATC TCT 
lie Ser 
40 

TGT CCC 
Cys Pro 
55 

CTT GTT 
Leu Val 



AAA GTT 
Lys Val 
10 

GAG TTT 
Glu Phe 
25 

GAT GCA 
Asp Ala 



AAC TCT 
Asn Ser 



CTC AAA 
Leu Lysi 



CTT GAG AAA/TGC 
Leu Glu Ly/ Cys 



ACA GTC /CCA CTG 
Thr Va]/ Pro Leu 
30 

GAA OfAC CAT CTG 
Glu yHis His Leu 
45 

TTT ATC ATT 
lVs Phe lie lie 
60 

:AC TTT CTT CCG 
lis Phe Leu Pro 
75 



CGT GTT GCG CCA CCA CCG GAC GCC 10 0 

Arg Val Ala Pro Pro Pro Asp Ala 
15 20 

TCG TTT TTC GAC ATG CGA TGG TTG 14 8 

Ser Phe Phe Asp Met Arg Trp Leu 
35 

CAT TTC TAC AGA TTC CGC CAT CCT 196 
His Phe Tyr Arg Phe Arg His Pro 
50 

TCA TCC ATT AAA TCG TCC CTT TCC 2 44 

Ser Ser lie Lys Ser Ser Leu Ser 
65 70 

TTA GCC GGG AAT TTG ATT TGG CCG 2 92 

Leu Ala Gly Asn Leu lie Trp Pro 
80 85 



GTA GAT TCC TCC GAT AGA ATG CCG GAG TTG CGT TAG AAG AAA GGG GA 
Val Asp Ser Ser Asp Arg Met Pro Glu Leu Arg Tyr Lys Lys Gly A 
90 95 100 



TCC GTT TCT TTA ACA ATT GCA GAA 
Ser Val Ser Leu Thr lie Ala Glu 
105 110 



TCG AGC ATG GAT 
Ser Ser Met Asp 



TTT GAT TA' 
Phe Asp 
115 



GCC GGA 
Ala Gly 
120 



GAT CAT CAG AGG 
Asp His Gin Arg 



GAT TCT 
Asp Ser 
125 



TAT AAA 
Tyr Lys 



TTC AAC 
Phe Asn 
130 



GAT 
Asp L' 



CAG CTG 
Gin Leu 
135 

TTT GCT 
Phe Ala 



CGC AAT 
Arg Asn 



AAT AAA 
Asn Lys 



TTC CTT 
Phe Leu 
200 

GAT CCA 
Asp Pro 
215 

AAA TCT 
Lys Ser 



GAA GTT 
Glu Val 



AAG AAA 
Lys Lys 



TAT AAT 
Tyr Asn 
280 

TTG TCG 
Leu Ser 
295 



CCA GAA CCG ATT 
Pro Glu Pro lie 
140 

TTA CAG GTG ACG 
Leu Gin Val Thr 
155 

CTT CAT CAA GTT 
Leu His Gin Val 
170 

TTA TGG GTT TTG 
Leu Trp Val Leu 
185 

CCA CTT TCT TCT 
Pro Leu Ser Ser 



TTT CAT 
Phe His 



GTA ACC 
Val Thr 



GTG TTC 
Val Phe 



CTT GGT 
Leu Gly 



GTT GAC 
Val Asp 
190 



TCC GGC 
Ser Gly 



TCC AAC 
Ser Asn 
160 

GAT GCC 
Asp 
175/ 



GAC GAA 
Asp Glu 
145 

ACC GGTy 
Thr Gl; 



^AC AGA 
Asp Arg 
210 

AAT GAA 
Asn Glu 
225 




GTjy TTA 
Val Leu 



ATA TGC 
lie Cys 



CAG GGC 
Gin Gly 



CGA GCC 
Arg Ala 
250 



AAA TCA TTA GAC 
Lys Ser Leu Asp 
300 



AAA AAT 
Lys Asn 
270 

ACG GTG 
Thr Val 
285 

ACC GTC 
Thr Val 



CTA AAT 
Leu Asn 
240 

CAC CCT 
His Pro 
255 

CGC AAC 
Arg Asn 



CCA GCA 
Pro Ala 



ATT GAT 
lie Asp 



TTA ACC 
Leu Thr 



ACA TCT GCA CTG 
Thr Ser Ala Leu 
290 

GTA AGA GAG AAG 
Val Arg Glu Lys 
305 



TTT CTG 
Phe Leu 
180 

GAT TCA 
Asp Ser 
195 

TCT GTG 
Ser Val 



AGA AAA 
Arg Lys 



ATT TCT 
lie Ser 



ATC ATG 
lie Met 
260 

GGT AGT 
Gly Ser 
275 

ATC TGG 
lie Trp 



CCA CTT 
Pro Leu 
150 

ATT GGA 
lie Gly 
165 

CAT TTT 
His Phe 



TTA AAG 
Leu Lys 



GTG CAA 
Val Gin 



CTG CTC 
Leu Leu 
230 

AAA GAT 
Lys Asp 
245 

AAG CTC 
Lys Leu 



AGT AAT 
Ser Asn 



ACA TGC 
Thr Cys 



GTG GAA GAG GAT 
Val Glu Glu Asp 
310 
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AAA CAT GCA GCA AAC TTA TGT OCT TTC 

Lys His Ala Ala Asn Leu Cys Ala Phe 
315 

OCT CCG CCG ATA CCT CAA AAT TAG TTT 

Ala Pro Pro lie Pro Gin Asn Tyr Phe 

330 335 

ATG GTG GGA TCG ACT CAT GAG CAA CTT 
Met Val Gly Ser Thr His Glu Gin Leu 
345 350 



GTA GCT GCA ACC GCC 
Val Ala Ala Thr Ala 
360 

TAG GAA GGA ATT CTG 
Tyr Glu Gly lie Leu 
375 

TCT GCG GCA CCA AGG 
Ser Ala Ala Pro Arg 
395 

CGC AAT GTG CAT GAT 
Arg Asn Val His Asp 
410 

AAG CAT GAA TCT GTT 
Lys His Glu Ser Val 
425 

TCT CGG TCC AGA AGA 
Ser Arg Ser Arg Arg 
440 

CCT AAG AAT AGG ATG 
Pro Lys Asn Arg Met 
455 



ATC GGA 
lie Gly 
365 

AGA GGA 
Arg Gly 
380 

TCG ACG 
Ser Thr 



TTT GAT 
Phe Asp 



TCA ACT 
Ser Thr 



TTT AAA 
Phe Lys 
445, 

GAC G^A 
Asp Ala 
460 



GAT GCT 
Asp Ala 



GAT TGG 
Asp Trp 



CTC ATT 
Leu lie 



GCA GA^ 
Ala Aspy 
41S 



TTT GCC 
Phe Ala 



AAT AGT CTC CAT GTG 
Asn Ser Leu His Val 
475 

TGTATAAAAA AGACAGAC 
GGGAAGATGT ATCAAACTj 
AAAAAAAAAA A 



(2) INFORMATION 




ATC AAC TGC CGA CAA CGT ^TT 
lie Asn Cys Arg Gin Arg/phe 
320 3: 

GGA AAT TGC ATA GTG /fcCT TGT 
Gly Asn Cys lie Vay Pro Cys 

34 

GTA GGA AAT GAA yGGG TTG TCG 
Val Gly Asn Gli/ Gly Leu Ser 
2i 

ATC CAT AAG AgG TTA CAT GAC 
lie His Lys^ Arg Leu His Asp 
31/ 

ATA TCG /CCG CCC CGA TCA ACA 
lie Sei/ Pro Pro Arg Ser Thr 

390 

TT GGA TCC GCA CAA 
al Gly Ser Ala Gin 
405 

TGG GGA AAG CTT GAA 
y Trp Gly Lys Leu Glu 
420 

CA ACA CTA ATT TTG ATC 
Ala Thr Leu lie Leu lie 
435 

CTT GAG CTT GGC ATT TCT TTG 
Leu Glu Leu Gly lie Ser Leu 
450 

ACC ATT TTT ACG AAT TTC ATC 
Thr lie Phe Thr Asn Phe lie 
465 470 



CCT TTG 
Pro Leu 



TAAGAAAAAA GTGGTATCAA 



GTTATGATGC AACAAATGTT TTAGGAGATT ACAAATCCAT 
:A TCTCTCTATA TATATATATT CAATTGTTTT AAAAAAAAAA 



1012 

1060 

1108 

1156 

1204 

1252 

1300 

1348 

1396 

1444 

1491 

1551 
1611 
1622 



SEQ ID NO: 3 



(i) SEQUENCE CHARACTERISTICS: 

(A) L^GTH: 16 05 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Petunia hybrida 
(F) TISSUE TYPE: petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pPAT4 8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 67.. 1410 



(xi) SEQUENCE DESCRIPTION: SB<I ID N? 
TGTCGACGAA ATCCATTTCA TTTCCTCT^C TTTC7*TG 



TTATCC ATG GCA GGT GAA GTA GCA AAA GAA 
Met Ala Gly Glu Val Ala Lysyfeln 
1 5 

GTC CTG AAA AAA ACA AAC GTG AAA /CCA 
Val Leu Lys Lys Thr Asn Val Lys^ Pro His 
15 20 / / 

GAG TGT CAA TTG GTA ACA TTT fiAT CTT CCT 
Glu Cys Gin Leu Val Thr Phe/Asp Leu Pro 
35 / 40 

AAC CAA AAA TTT CTC ATC yXT AAA GGT GCT 
Asn Gin Lys Phe Leu lie yTyr Lys Gly Ala 
50 / 55 

GTG GAA AAA ATT AAA GPJT GGA CTG GCC TTA 
Val Glu Lys lie Lys ASp Gly Leu Ala Leu 
65 / 70 

CAA CTA GCT GGG AAA /CTT GGA AAA GAT GAA 
Gin Leu Ala Gly Lys/ Leu Gly Lys Asp Glu 
80 / 85 

GAA TAC GAC GAT GAfc ATG GAT GGT GTA GAG 
Glu Tyr Asp Asp Asp Met Asp Gly Val Glu 
95 / 100 



TT TTCTAATTTC GTCATCATTG 60 

GTT ACA AAA GTG AAA 108 
;iu Val Thr Lys Val Lys 
10 

AAA CCA CTA GGA AAA AAA 156 
Lys Pro Leu Gly Lys Lys 
25 30 

TAC CTA GCT TTC TAT TAC 2 04 

Tyr Leu Ala Phe Tyr Tyr 
45 

GAA AAC TTT GAC GAG ACG 252 
Glu Asn Phe Asp Glu Thr 
60 

GTA TTG GTG GAT TTC TAT 3 00 

Val Leu Val Asp Phe Tyr 
75 

GAA GGG GTT TTC AGG GTG 34 8 

Glu Gly Val Phe Arg Val 
90 

GTG ACA GTG GCT GTT GCA 3 96 

Val Thr Val Ala Val Ala 
105 110 



GAA GAG 
Glu Glu 



TTG CAG 
Leu Gin 



CAT CGC 
His Arg 



ACC ATG 
Thr Met 
160 

TGG CAC 
Trp His 
175 

ATT TCT 
lie Ser 



GTC AAG 
Val Lys 



TCA GCA 
Ser Ala 



GAA AGA 
Glu Arg 
240 

ACA GTC 
Thr Val 
255 



ATA GAA GTT 
lie Glu Val 
115 

GAC TTG ATT 
Asp Leu lie 
130 

CCT CTT CTA 
Pro Leu Leu 
145 

GGA TTA GCA 
Gly Leu Ala 



TTT ATG ACC 
Phe Met Thr 



GCA GAT 
Ala Asp 



CCT TGT 
Pro Cys 



GCT GTG 
Ala Val 



GTC CCA CCA 
Val Pro Pro 
195 

CTC AAC CTC 
Leu Asn Leu 
210 

ACC AAC GGT 
Thr Asn Gly 
225 

GTA TTC AAG 
Val Phe Lys 



AAT GCC AAC 
Asn Ala Asn 



TTT AAC 
Phe Asn 
165 

TCG TGG 
Ser Trp 
180 

TTC CTT 
Phe Leu 



CTT ACT 
Leu Thr 



AAT AAA 
Asn Lys 
135 

CAG CTC 
Gin Leu 
150 

CAT GCT 
His Ala 



GAT GAA 
Asp Glu 
120 

ATC TTG 
lie Leu 



ACC AAG 
Thr Lys 



GTG CTG 
Val Leu 



GAA GGC 
Glu Gly 



AAT TTG 
Asn Leu 



ACC ACC 
Thr ThryLys 
125 




TCC GAG 
Ser Glu 



GAA CGA 
Glu Arg 



TCT CAA 
Ser Gin 



GAT GTC 
Asp Val 



TTC TC 
Phe S 



TCA/ GGA 
Ser Gly 
260 



GAG ACG 
Glu Thr 



:CC GAA 
^Pro Glu 



GAC CCA 
Asp Pro 
235 

GAC AAA 
Asp Lys 
250 

TCC ACA 
Ser Thr 



;AC GGG CTC 
Asp Gly Leu 



ACT TCG ACG 
Thr Ser Thr 



TCC ACC TCA 
Ser Thr Ser 
190 

AAC ACT CGA 
Asn Thr Arg 
205 

CAT GCT AAG 
His Ala Lys 
220 

CCT CTT CGC 
Pro Leu Arg 



ATC AAG TCA 
lie Lys Ser 



TTC CAA TCA 
Phe Gin Ser 
270 



444 



492 



540 



588 



636 



684 



732 



780 



828 



876 



CTC TCC 
Leu Ser 



CCC GAG 
Pro Glu 



GTT GAT 
Val Asp 



GCA CAC GTG 
Ala His val/ 
27/5 



2^5 



GAC TAC ACT 
Asp Tyr /Thr 
290'' 

/ 

CCT CCA ATG 
Pro Pro Met 
305 ' 



/TGG CTA 
Trp Leu 



GTG TAC 
Val Tyr 



CCA GAA 
Pro Glu 



GCC GTC 
Ala val 



ACT GTG 
Thr Val 
295 

AGT TAC 
Ser Tyr 
310 



ACA CGT 
Thr Arg 
280 

TTT GCT 
Phe Ala 



TTC GGC 
Phe Gly 



GCG CGC 
Ala Arg 



GAT TGC 
Asp Cys 



AAC CTA 
Asn Leu 
315 



CAA CTC AAG 924 
Gin Leu Lys 
285 

AGG AAA AGG 972 

Arg Lys Arg 

300 

ATT CAG GCA 102 0 

lie Gin Ala 



ATT TTC 
lie Phe 
320 



ACA GTG ACC 
Thr Val Thr 



GCG GCA 
Ala Ala 
325 



GGT TTG 
Gly Leu 



TTA CTA 
Leu Leu 



GCA AGC 
Ala Ser 
330 



CCG ATC GAG 
Pro lie Glu 



1068 



TTC GCT 
Phe Ala 
335 

GCC ATT 
Ala lie 



GAG TAG 
Gin Tyr 



AGG TTC 
Arg Phe 



GTG AGG 
Val Arg 
400 

CAA GGC 
Gin Gly 
415 

GCA AAT 
Ala Asn 



ACT GCT 
Thr Ala 



GGT GGG ATG ATA CAA CAA 
Gly Gly Met lie Gin Gin 
340 

GAT GAA AGA AAC AAG GAG 
Asp Glu Arg Asn Lys Glu 
355 

AAA GAT GCT GGA GTG AAC 
Lys Asp Ala Gly Val Asn 
370 

AAG GTT TAG GAC GTG GAT 
Lys Val Tyr Asp Val Asp 
385 390 

AGT GGT TCG AAC AAT AGG 
Ser Gly Ser Asn Asn Arg 
405 



GCG ATC 
Ala lie 



TGG GAG 
Trp Glu 
360 

TGT GTT 
Cys Val 
375 

TTT GGA 
Phe Gly 



TTT GAT 
Phe Asp 



GTG AAG CAT 
Val Lys His 
345 

AGC AAC CCG 
Ser Asn Pro 



GCT GTT GC 
Ala Val G^ 



GG^ ATG GTG 
gVy Met Val 
410 



GAC GOT AAG 
Asp i^aa Lys 
350 

ATC TTT 
s lie Phe 
365 

AGT TCG CCA 
Ser Ser Pro 
380 

CCA GAG AGT 
Pro Glu Ser 



TAT TTG TAG 
Tyr Leu Tyr 



(2) INFORMATION FOR 
(i) SEQUENCE CI 




AAA AAT GGA GGA AGA AGC 
Lys Asn Gly Gly Arg Se 
420 

GCT ATG GAG AGG TTG GAG 
Ala Met Glu Arg Leu Glu 
435 

TAATTTGCTT AGCTTGGACT G 



[AG ATT 
lie 



GAG TTC 
s Glu Phe 



AGT TTG GAA 
Ser Leu Glu 
430 

CTC ATG GAA 
Leu Met Glu 
445 



CACTTTATTT ATGAGCTGCT 



ATGACTCACA TGCATGTATG TTTATT^TTTT TTGGAGGGGT TCTTTCCTTT TATTGTTTTC 
TATGTTTTTT CTTTCTTGTA CGTT^TGAAG AGAAACCGAG TATAAAGGAA TAATGTTTTC 
AGTTATTAAA AAAAAAAAAA 



ID NO : 4 : 
\CTERISTICS 



(A) length/ 14 7 9 base pairs 

(B) TYPE : /nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPo/iOGY: linear 

(ii) molecule/ TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



1116 

1164 

1212 

1260 

1308 

1356 

1404 

1460 

1520 
1580 
1605 



(vi) ORIOINAL SOURCE: 

{Ay ORGANISM: Perilla ocimoides 



60 



(F) TISSUE TYPE: leaf 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: CDNA library 

( B ) CLONE : pS AT2 0 8 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3.. 1340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



CC GTG ATC GAA ACQ TGT AGA GTT GGG CCG CCG CCQ GAC TCG GTG GCG 
Val lie Glu Thr Cys Arg Val Gly Pro Pro P^o Asp Ser Val Ala 
1 5 10 / 15 



47 



GAG CAA TCG GTG CCG CTC ACA TTC TTC GAC 
Glu Gin Ser Val Pro Leu Thr Phe Phe Asp 
20 



7G ACG TGG CTG CAT TTT 
Trp Leu His Phe 
30 



95 



CAT CCC ATG CTT CAG CTC CTC TTC TAC 
His Pro Met Leu Gin Leu Leu Phe Ty^ 
35 



TTC CICT TGT TCC AAG CAA 
ro Cys Ser Lys Gin 
45 



Lu Phe 



143 



CAT TTT TCA GAA TCC ATC GTT CCA AA^V CTC 
His Phe Ser Glu Ser lie Val Pro lyys Leu 
50 55 



CAA TCT CTC TCT AAA 
Gin Ser Leu Ser Lys 
60 



191 



ACT CTC ATA CAC TTC TTC CCT CTj 
Thr Leu lie His Phe Phe Pro 
65 70 



TCA TGTC AAT TTA ATC TAC CCT TCA 
u Ser Cys Asn Leu lie Tyr Pro Ser 
75 



239 



TCC CCG GAG AAA ATG CCG GAO" TTT CGG TAT CTA TCC GGG GAC TCG GTT 
Ser Pro Glu Lys Met Pro Gl/U Phe Arg Tyr Leu Ser Gly Asp Ser Val 
80 85 / 90 95 



287 



TCT TTC ACC ATC GCA GAA/ TCT AGC GAC GAC TTC GAT GAT CTC GTC GGA 
Ser Phe Thr lie Ala Gl\/ Ser Ser Asp Asp Phe Asp Asp Leu Val Gly 
100 / 105 110 



335 



AAT CGT CCA GAA TCT yCCC GTT AGG CTC TAC AAC TTT GTC CCT AAA TTG 
Asn Arg Pro Glu Ser/pro Val Arg Leu Tyr Asn Phe Val Pro Lys Leu 
115 / 120 125 



383 



CCG CCC ATT GTC Q^A GAA TCC GAT AGA AAA CTC TTC CAA GTT TTC GCC 
Pro Pro lie Val Slu Glu Ser Asp Arg Lys Leu Phe Gin Val Phe Ala 
130 / 135 140 



431 



GTG CAG GTG ACT CTT TTC CCA GGC CGA GGC GTC GGT ATT GGA ATA GCA 
Val Gin Val TMr Leu Phe Pro Gly Arg Gly Val Gly lie Gly lie Ala 
145 / 150 155 



479 



ACG CAT CAC /ACC GTT AGC GAC GCC CCG TCG TTT CTC GCG TTT ATA ACG 
Thr His His/Thr Val Ser Asp Ala Pro Ser Phe Leu Ala Phe lie Thr 



527 



- 61 



160 

GCT TGG TCT 
Ala Trp Ser 



GAA TTT AAA 
Glu Phe Lys 



ACG AAA TTT 
Thr Lys Phe 
210 

CAA TCT CGT 
Gin Ser Arg 
225 

GTT TTC ACC 
Val Phe Thr 
240 

AGA GTT CCA 
Arg Val Pro 



TAT ATG TGG 
Tyr Met Trp 



165 

TCA ATG AGC AAA 
Ser Met Ser Lys 
180 

TCT TTG CCA GTT 
Ser Leu Pro Val 
195 

GAC TCC ATT TAT 
Asp Ser lie Tyr 



170 



CAT CCC 
His Pro 



CAA TCC 
Gin Ser 



GAC AAC GAG 
Asp Asn Glu 
290 

TTA GAT CCG 
Leu Asp Pro 
305 

GCG CTG CCG 
Ala Leu Pro 
320 

TTT CTG GCA 
Phe Leu Ala 



GAC AAG AGA 
Asp Lys Arg 



AAA GCG TTG 
Lys Ala Leu 
370 



AGT TTA 
Ser Leu 
260 

GCT GGC 
Ala Gly 
275 

GAT GCA 
Asp Ala 



TCA TTA 
Ser Leu 
230 

AAA ATT 
Lys lie 
245 

GTC CAT 
Val His 



CAC ATT 
His lie 



TTC GAT 
Phe Asp 
200 

TGG AGA 
Trp Arg 
215 

CCG ACG 
Pro Thr 



AAG AAA 
Lys Lys 



CTC TCA 
Leu Ser , 



GAA AAT GAA GAT 
Glu Asn Glu Asp 
185 



GAA GATT GAA 
Glu App Glu 



AAC GCG 
Asn Ala 



GAC CGC 
Asp Arg 



ATA ACG 
lie Thr 



TTT TTC 
Phe Phe 



CCG GTT 
Pro Val 



TTC GGG 
Phe Gly 



A GAT 
la Asp 



GAT CTA 
Asp Leu 
300 

AAC TGC 
Asn Cys 
315 





AGA TCC GTC ATA 
Arg Ser Val lie 



CTA 
Leu pys 
220 



TAT CCG 
Tyr Pro 



TTT CCT TTG 
Phe Pro Leu 



ACC ACG TTC 
Thr Thr Phe 



ATT CAG TCC 
lie Gin Ser 
255 

ATT GCA GCT 
lie Ala Ala 
270 

GAA GAC CAA 
Glu Asp Gin 
285 

AGG CCA CGA 
Arg Pro Arg 



TTA TCG TAC 
Leu Ser Tyr 



AGA ATG 
Arg Met 



GAA ACG 
Glu Thr 



TCA TAT 
Ser Tyr 



GCG GCG 
Ala Ala 



GTG GAG 
Val Glu 
360 

TTT TCG 
Phe Ser 
375 



CTG GTG 
Leu Val 
330 

GAG ATA 
Glu He 
345 

AAA TGG 
Lys Trp 



GGA GAG 
Gly Glu 



AAA AAA 
Lys Lys 



TCG CCG 
Ser Pro 



GTG GCA GGA TCG 
Val Ala Gly Ser 
380 



AAA GGG GTG 
Lys Gly Val 
335 

AGG ATC AAC 
Arg He Asn 
350 

GAG ATT CGT 
Glu He Arg 
365 

AGC AAG CTA 
Ser Lys Leu 



575 



623 



671 



719 



767 



815 



863 



911 



959 



1007 



1055 



1103 



1151 



- 62 - 



GAT CTT TAC GGT GCA GAT TTT GGA TGG GGG AAG 
Asp Leu Tyr Gly Ala Asp Phe Gly Trp Gly Lys 
385 390 

ATA TTG TCG ATT GAT GGG GAG AAA TAT GCA ATG 
lie Leu Ser lie Asp Gly Glu Lys Tyr Ala Met 
400 405 410 

AGG GAT TTC GAA GGA GGA TTG GAG GTT TGC TTG 
Arg Asp Phe Glu Gly Gly Leu Glu Val Cys Leu 
420 425 

AAA ATG GAT GCT TTT GCT GCT TAT TTT TCA CTG 
Lys Met Asp Ala Phe Ala Ala Tyr Phe Ser Leu 
435 440 

TAATAAATGT ATGTAATTAA ACTAATATTA TTATGTAAC^ 

CGTGAAGAAT AATCCCTATT ATATATTTAT GATTTGGT;^ 

TGAAAAAAAA AAAAAAAAA 



GCG AGA AAG CAJf GAA 
Ala Arg Lys G/n Glu 
395 

ACG CTT TG;^ AAA GCC 
Thr Leu C^s Lys Ala 
415 

TCT TTCS CCT AAG GAC 
Ser L^u Pro Lys Asp 
430 

GG^ ATT AAT GGT 
G]/y lie Asn Gly 
445 

'^ATTAATTAAG TGTTGAGTAA 
AAATAAAGTG TAAAGCCTCT 



1199 

1247 

1295 

1340 

1400 
1460 
1479 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1508 base pc 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilb^ 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA \fo mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: ySenecio cruentus 
(F) TISSUE TY^E: petal 

(vii) IMMEDIATE SOUfeCE : 

(A) LIBRAR'!^ cDNA library 

(B) CLONE 

(ix) FEATURE: 

(A) name/key : CDS 

(B) LOCATTION: 3.. 1364 



(xi) SEQUENC^ DESCRIPTION: SEQ ID NO : 5 : 

TG AAC ATT CTC iGAA CAT GCC CGA ATA TCG GCC CCC TCG GGC ACC ATC 

Asn lie Leu/Glu His Ala Arg lie Ser Ala Pro Ser Gly Thr lie 

1/5 10 15 



47 



63 



GGC CAT CGC 
Gly His Arg 



TCG TTA TCT CTT ACT TTC TTC GAC ATT ACT 
Ser Leu Ser Leu Thr Phe Phe Asp lie Thr 
20 25 



TGG CTA CTC 
Trp Leu ^eu 
3( 



95 



TTC CCT CCG 
Phe Pro Pro 



GTC CAC 
Val His 
35 



CAT CTT 
His Leu 



TTC TTC 
Phe Phe 
40 



TAT GAC TTT CCA 
Tyr Asp Phe Pro 



CAT TCT AAA 
His/Ser Lys 
45 



143 



TCC CAT TTC ATG GAC ACT ATT GTT CCC AGG CTA AAA CAA 
Ser His Phe Met Asp Thr lie Val Pro Arg Leu Lys Gln^ 
50 55 6 



rCT TTA TCG 
^Ser Leu Ser 



191 



GTC ACT CTT 
Val Thr Leu 
65 



CAA CAT TTT TTC CCG TTT GCT AGT AAT TTG 
Gin His Phe Phe Pro Phe Ala Ser Asn/Leu 
70 7 



ATT GTA TTT 
lie Val Phe 



239 



CCT AAC ACT 
Pro Asn Thr 
80 

GTT GAA GGT 
Val Glu Gly 



TTT AAT AAT 
Phe Asn Asn 



CCA CTT GTA 
Pro Leu Val 
130 

ACG GTC CCA 
Thr Val Pro 
145 

ATA TCA CTA 
He Ser Leu 
160 



GAT GGT 
Asp Gly 



GAT TCT 
Asp Ser 
100 

TTG ACA 
Leu Thr 
115 

CCT TCA 
Pro Ser 



CTT TTT 
Leu Phe 



GGA ATG 
Gly Met 



TCG GGT 
Ser Gly 
85 

GTT GTG 
Val Val 



GGA AAT 
Gly Asn 



TTG GGA 
Leu Gly 



TCA CTT 
Ser Leu 
150 

ACG AA* 
Thr A4n 



TTT AAT 
Phe Asn 



GTT ACT 
Val Thr 



CAT CCT 
His Pro 
12 



AAA AAA ^CA GAA 
Lys Lys/Pro Glu 

9: 




CAT CAT 
His His 



AGC CTT 
Ser Leu 
170 



GGT GAC 
Gly Asp 



ATA AAA CAC 
He Lys His 
95 

TGT CTT GAC 
Cys Leu Asp 
110 

AAC TTT TAT 
Asn Phe Tyr 
125 

GAT TGC GTC 
Asp Cys Val 



GGC TCG GGT 
Gly Ser Gly 



GCT AGC ACG 
Ala Ser Thr 
175 



287 



335 



383 



431 



479 



527 



CGG TTC AAC 
Arg Phe Asn 



TTT TTG 
Phe Leu 
180 



TGG ACT 
Trp Thr 



TCG ATT 
Ser He 
185 



ATT CAA 
He Gin 



TCT GGT GTA 
Ser Gly Val 
190 



575 



GAT CGG TCT 
Asp Arg Ser 



TTT 
Phe Le\ 
195 



^ACG AAA 
Thr Lys 



GGA TCT 
Gly Ser 
200 



CCA CCG 
Pro Pro 



GTT TTT 
Val Phe 



GAT AGA TTG 
Asp Arg Leu 
205 



623 



ATT AAC ATC 
He Asn He 
210 



CCA OAT 
Pro His 



TTA GAT 
Leu Asp 



GAA AAT 
Glu Asn 
215 



AAG TTG AGA CAT 
Lys Leu Arg His 
220 



ACA AGG CTC 
Thr Arg Leu 



671 



GAA AGT TTT TAt AAA CCT TCG AGC CTT GTT GGT CCC ACT 
Glu Ser Phe TVr Lys Pro Ser Ser Leu Val Gly Pro Thr 
225 7 230 235 



GAT AAA GTT 
Asp Lys Val 



719 



- 64 - 



CGG 
Arg 
240 



TCA ACG TTT GTG 
Ser Thr Phe Val 



TTG ACC CGA ACT AAT ATC AAT CTA CTA AAGT AAA 
Leu Thr Arg Thr Asn lie Asn Leu Leu Is^s Lys 
245 250 / 255 



767 



AAG GTC TTA ACC CAA GTG CCA AAC TTG GAG TAG ATG TCA TOT TTT ACG 
Lys Val Leu Thr Gin Val Pro Asn Leu Glu Tyr Met Ser afer Phe Thr 
260 265 / 270 



815 



GTA ACT TGT GGT TAT ATA TGG AGT TGC ATA GCG AAA TCA CTC GTA AAA 
Val Thr Cys Gly Tyr lie Trp Ser Cys lie Ala Lys skx Leu Val Lys 
275 280 / 285 



863 



ATA GGA GAA AGA AAG GGC GAA GAC GAG TTA GAA CACS TTC ATA ATC ACC 
lie Gly Glu Arg Lys Gly Glu Asp Glu Leu Glu GYn Phe lie lie Thr 
290 295 / 300 



911 



ATT GAT TGT CGA TCT CGT CTT GAT CCA CCA ATy CCC ACA GCC TAC TTT 
lie Asp Cys Arg Ser Arg Leu Asp Pro Pro ILfe Pro Thr Ala Tyr Phe 
305 310 / 315 



959 



\ 



GGT 
Gly 
320 

ACT 
Thr 



ATA 
He 



AAC TGT 
Asn Cys 



ACG GAA 
Thr Glu 



TGC AAA 
Cys Lys 



AGA 
Arg 



TGG CAT 
Trp His 
370 



GGT GCA 
Gly Ala- 



AAT GGG 
Asn Gly 
340 

ATG ATA 
Met He 
355 

GAA CCT 
Glu Pro 



CCA TGT 
Pro Cys 
325 

TAT GCA 
Tyr Ala 



TAT AAT 
Tyr Asn 



TTC ATG 
Phe Met 



GTC CCG ACCy 
Val Pro Th? 



VT GTC GTT TTG 
sn Val Val Leu 
335 



CTT 


GGT 


yGCT 


AAAl 


iGTAy 


^ATT 


GGA 


GAG 


TCT 


Leu 




[Ala 


Lys 


Wl' 


He 


Gly 


Glu 


Ser 






345 










350 




AAG 


GAC 


GGA 


ATC 


/TTG 


AAA 


GAT 


GCC 


GCG 


Lys 


Asp 


Gly 


"/ 


Leu 


Lys 


Asp 


Ala 


Ala 




^360 










365 






ATC 


CCG 


GCT 


AGG 


AAG 


ATT 


GGT 


GTT 


GCT 


lae 


Pro 


Ala 


Arg 


Lys 


He 


Gly 


Val 


Ala 


y375 










380 









1007 



1055 



1103 



1151 



GGT ACA CCT AAG CTC AAC TT^ TAC GAC TTT GAT TTT GGG TGG GGG AAG 
Gly Thr Pro Lys Leu Asn L^u Tyr Asp Phe Asp Phe Gly Trp Gly Lys 
385 3^0 395 



1199 



CGC 
Arg 
400 



ATA AAG 
He Lys 



TAT GAG 
Tyr Glu 



TCA ATA GAC TAT AAT ACG TCG ATT TCT 
Ser He Asp Tyr Asn Thr Ser He Ser 
410 415 



1247 



ATA AAT GCA AGC AAA ACA TCA 
He Asn Ala Ser Lys /Thr Ser 
420j 



GCA CAA GAT CTT GAA ATT GGA TTG AGT 
Ala Gin Asp Leu Glu He Gly Leu Ser 
425 430 



1295 



CTA CCG AGT ATG CA5V ATG GAG 
Leu Pro Ser Met Gin Met Glu 
435 



GCG TTT TCT AGC ATC TTT GAT GAA GGA 
Ala Phe Ser Ser He Phe Asp Glu Gly 
440 445 



1343 



TTA GAG AGT CAA /GTT TCA TTG 
Leu Glu Ser Gin' Val Ser Leu 
450 



TAGATCATCG TCCCCTTTTT GTGTGCATCA 



1394 



- 65 - 



AGTTTCTGTC GTTTTTATGA GTTGCCACTG TTCTATTCTT TAAGTATACC TTTCQJVCTAT 14 54 

GTTTTGAAGA TGCAACGATA TAAAATGAAA AAAAAAAAAA AAAAAAAAAA AAAA 1508 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D ) TOPOIiOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to ruRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lavandula anguj 
(F) TISSUE TYPE:- petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pLAT21 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3., 135 

(xi) SEQUENCE 



-TG ACC ACC CTC CTC 
Xxx Thr Thr Leu Leu 
1 

GTG GCT GAG CAG TCA 
Val Ala Glu Gin Ser 
20 

CAT TTC CAC CCC ATG 
His Phe His Pro Met 
35 

AAA CCC GCC TTC C 
Lys Pro Ala Phe Lj^u 
50 

TCT CTA ACC CTC /AAA 
Ser Leu Thr Lei/ Lys 
65 




CAC TTC 
His Phe 
70 



TCC CGA GTG GCG CCG CCT CCA GGC ACG 
Ser Arg Val Ala Pro Pro Pro Gly Thr 
10 15 

CTC ACC TTC TTC GAC ATG ACG TGG CTG 
Leu Thr Phe Phe Asp Met Thr Trp Leu 
25 30 

CTT CTC TTC TAC GAA CTC CCC TGT TCC 
Leu Leu Phe Tyr Glu Leu Pro Cys Ser 
40 45 

GTC GTT CCG AAA CTC AAA CAA TCC TTA 
Val Val Pro Lys Leu Lys Gin Ser Leu 
55 60 

TTC CCC CTT TCA TGC AAT CTA ATC TAC 
Phe Pro Leu Ser Cys Asn Leu lie Tyr 
75 



47 



95 



143 



191 



239 



- 66 - 



CCT CTA TCG 
Pro Leu Ser 
80 



CCG GAG 
Pro Glu 



AAA ATG CCG 
Lys Met Pro 
85 



GAG TTC 
Glu Phe 



CGG TAT CAG 
Arg Tyr Gin 
90 



AAC GGT GAC 
Asn Gly y^sp 
95 



287 



TCG GTT TCT 
Ser Val Ser 



TTC ACG 
Phe Thr 
100 



ATT ATG GAG 
lie Met Glu 



TCT GTC 
Ser Val 
105 



GGA GAT CAT 
Gly Asp His 



CCG <2AT TCC 
Pro/His Ser 
110 



335 



GCT CAT AAA 
Ala His Lys 



TAC TAC 
Tyr Tyr 
115 



TGC TTT GCC 
Cys Phe Ala 



CCT AGC 
Pro Ser 
120 



GAC GAT TAT 
Asp Asp Tyry 



5AA GAT CTC 
'^Glu Asp Leu 
125 



383 



CAG CTG CCG. 
Gin Leu Pro 
130 



CCG ATA 
Pro lie 



GTC GAG GAA 
Val Glu Glu 
135 



TCT GAT 
Ser Asp 



CGG AAA ^TG 
Arg Lys/ Leu 
140 



TTT CAA GTT 
Phe Gin Val 



431 



TTA GCC GTG 
Leu Ala Val 
145 



CAA GTG 
Gin Val 



ACT CTG TTT 
Thr Leu Phe 
150 



CCC GGT 
Pro Gly 



CGC ^GG GTG 
Arg/eiryvVal 
155 



TGC ATC GGA 
Cys lie Gly 



479 



ATA ACG ACG 
lie Thr Thr 
160 



CAC CAC 
His His 



ACC GTT AGC 
Thr Val Ser 
165 



'GCT ^CA TCCyTTT GTA GGG TTT 
Se/ Phe Val Gly Phe 
175 



527 



ATG AAG AGT 
Met Lys Ser 



TGG GCT 
Trp Ala 
180 



TCC ATC ACT AAA 
Ser lie Thr Lys 



PTC GGA 
the Gly 
185 



/GGA GAT 
Gly Asp 



GAT GAA TTC 
Asp Glu Phe 
190 



575 



TTG GAC GGA 
Leu Asp Gly 



A7^ GGT 
Lys Gly 
195 



GAA TGT TTG 
Glu Cys Leu 



:G GTT 
Val 
200 



TTC GAC CGA 
Phe Asp Arg 



TCG CTC GTG 
Ser Leu Val 
205 



623 



AAT TAT CCG 
Asn Tyr Pro 
210 



CCT AAA 
Pro Lys 



TTG GAC 
Leu Asp ^hr 
;15 



TAT TTA 
Tyr Leu 



TGG AAC AAC 
Trp Asn Asn 
220 



GCG CAG AAA 
Ala Gin Lys 



671 



CGT CCG TTG 
Arg Pro Leu 
225 



GAA TCG 
Glu Ser 



CAG ciyr CCA 

Gin Bfi^s Pro 
30 



TCT TTA 
Ser Leu 



CCG ACG GAT 
Pro Thr Asp 
235 



CGG ATT CGA 
Arg lie Arg 



719 



GCT ACC TAC 
Ala Thr Tyr 
240 



CTT TTC 
Leu Phe 



ACQ CAA TCT 
TY)pc Gin Ser 
2/ 



GAA ATT 
Glu lie 



AAG AAA TTG 
Lys Lys Leu 
250 



AAG GGT TTG 
Lys Gly Leu 
255 



767 



ATT CAG AGA 
lie Gin Arg 



AAA GCC 
Lys Ali 

26, 



/CCA AAT GTA 
Pro Asn Val 



GTT AAT 
Val Asn 
265 



CTC TCT TCC 
Leu Ser Ser 



TTC GTC GCG 
Phe Val Ala 
270 



815 



ATC GCA GCT 
lie Ala Ala 



TAT jyrc 

Tyr ^le 
27 



TGG ACC GGC 
Trp Thr Gly 



ATC GCC 
lie Ala 
280 



AAA TCG GTC 
Lys Ser Val 



GGA GAT TAC 
Gly Asp Tyr 
285 



863 



AAA GAC GTG 
Lys Asp Val 
290 



GAT GAC 
Asp Asp 



GAC AAA CGC 
Asp Lys Arg 
295 



GCT TTC 
Ala Phe 



TTT TTA ATT 
Phe Leu lie 
300 



CCG ATC GAT 
Pro lie Asp 



911 



- 67 - 



TTA AGG CCG CGT TTG GAT CCG CCG GCT CCG GGG AAC TAG TTC GGA/AAC 
Leu Arg Pro Arg Leu Asp Pro Pro Ala Pro Gly Asn Tyr Phe Gly Asn 
305 310 315 

TGT CTA TCG TTT GCG ATG GCG AAG ATC CTG CGG CGG GAT TTG/GTC GGA 
Cys Leu Ser Phe Ala Met Ala Lys lie Leu Arg Arg Asp Lejd Val Gly 
320 325 330 / 335 

GAT GAA GGG GTG TTT CGG GCA GCT GAG GCG ATC GCG GCG/GAA ATA GAG 
Asp Glu Gly Val Phe Arg Ala Ala Glu Ala lie Ala Alaf Glu lie Glu 
340 345 / 350 

AAG AGG ACG AGC GAC AAG AAG ATT CTA GAA ACT GTG yt^AG AAC TGG CCG 
Lys Arg Thr Ser Asp Lys Lys lie Leu Glu Thr VayGlu Asn Trp Pro 
355 360 / 365 

PC TCG GTG GCG GGA 
Ser Val Ala Gly 
,380 



TCT GAG ATT CGC GAA GCC TTG CAA AAC TGT TAT 
Ser Glu lie Arg Glu Ala Leu Gin Asn Cys Tyr 
370 375 



TCG AGC AGG CTT GAT CTT TAC GGC GCG G 
Ser Ser Arg Leu Asp Leu Tyr Gly Ala yAsp 
385 ~ 390 



T'J'T C^QKlllQfQt GGT AAG GCG 
le Gl^ Trp Gly Lys Ala 



GTG AAG CAA GAG ATA CTG TCG ATT G^^T GGA GAG 
Val Lys Gin Glu lie Leu Ser lie Adp GW Gli 
400 405 /41/ 



^G TTT ACG ATG TCG 
'Lys Phe Thr Met Ser 
415 



TTG TGT AAA CCG AGG GAT GCT GCC GGA /GGA 
Leu Cys Lys Pro Arg Asp Ala Ala Gly Gly 
420 / 425/ 



rTG GAG GTT GGA TTG TCT 
jeu Glu Val Gly Leu Ser 
430 



TTG CCA AAG GAG GAA TTG CAA GCT TTT GAT GAT TAT TTT GCG GAG GGA 
Leu Pro Lys Glu Glu Leu Gin Ala /Phe Asp Asp Tyr Phe Ala Glu Gly 
435 / 440 445 

ATA AAG GGT TGATTAATCA TTTAATCT^ITG TATTATGAAG TTGGATGAAA 
lie Lys Gly 
450 

TCCTCTGTTT CATCTCTATT GTTTJ^CTUV TAATTTTTTT CCATTGAACT TTTTTGAGTC 
AATAAAAAAA AAAAAAAAAA AAA^^AAAATG AAAAAACTCA GTTATTTTTT xtTTTTTTTT 



(2) INFORMATION FOR SEJS ID NO : 7 : 

(i) SEQUENCE CH;^tACTERISTICS : 

(A) length/ 10 amino acids 

(B) TYPE: Amino acid 

(C) STRANpEDNESS : single 

(D) TOPOLOGY: linear 




959 

1007 

105 5 

1103 

1151 

1199 

1247 

1295 

1343 

1392 

1452 
1512 
1521 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Arg Phe Leu Gly lie Thr Gly Ser Pro Lys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ 

lie His Met Asp Ala Phe Ala Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino ^cids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : si^igle 

( D ) TOPOLOGY : 1 ineai 

(ii) MOLECULE TYPE: pep^de 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIBTION: SEQ ID NO : 9 : 

Gly Val Glu lie Gly Val Ser Leu Pro Lys 
1 5/ 10 



(2) INFORMATION FOR 



ID NO: 10; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8 amino acids 

(B) TYPEf^: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECUi/e TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Ser Leu Ser Leu Thr Leu Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 

His Tyr Val Pro Leu Ser Gly Asn Le^ 
1 5 



lie Lys 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids ^ 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION/ SEQ ID NO: 12: 

Val Arg Ala Thr Tyr Val/ Leu Ser Leu Ala Glu lie Gin Lys 
1 5 / 10 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARAOTERISTICS : 

(A) LENGTH: 8/ amino acids 

(B) TYPE: amino acid 

(C) STRANDEmiESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



- 70 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

lie His Met Asp Ala Phe Ala Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : /4 

Lys lie His Met Asp Ala Phe Ala Ly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS :y 

(A) LENGTH: 8 amino aci/as 

(B) TYPE: amino acid 

(C) STRANDEDNESS: sin§ 

(D) TOPOLOGY: linear. 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Lys lie His Met Asp/ Ala Phe Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) type/ nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPpLOGY: linear 

(ii) MOLECUli: TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQU^CE DESCRIPTION: SEQ ID NO: 16 
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AARATHCAYA TGGAYGCNTT YGC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CTCGAGTTTT tTTTTTTTTT TTT 

(2) INFORMATION FOR SEQ ID N0:18i 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 26 base pai 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genc 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION;/ SEQ ID NO 
TTCACCATGG AGCAAATCCA AATGGy 

(2) INFORMATION FOR SEQ ID'^N0:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: I y base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYfiE : DNA (genomic) 
(iii) HYPOTHETIC^: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CGAGTCGCCC TCATCA^ 

(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AACAGCTATG ACCATG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID yi^O 

Asp Phe Gly Trp Gly Lys / 
1 5 / 



(2) INFORMATION FOR SEQ ID NO: 22:/ 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 base pAirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sittigle 

( D ) TOPOLOGY : 1 ineat 

(ii) MOLECULE TYPE: DNA /(genomic) 
(iii) HYPOTHETICAL: NO / 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAYTTYGGNT GGGGNAA / 



(2) INFORMATION FOR SEOf ID NO:23: 

(i) SEQUENCE CHAI^CTERISTICS : 

(A) LENGTH: pi base pairs 

(B) TYPE: niicleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
TGGCAACTGT CTTGCGTCAT G 



21 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CCATGTCAGG TGTGAGGTTC AAC 



23 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base paii 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sing/e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (gfenomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPT/oN: SEQ ID NO: 25; 
ATCGTTTCGC ATGATTGAAC 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CH^^IaCTERISTICS : 

(A) length/ 20 base pairs 

(B) TYPE: Aiucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOItOGY: linear 

(ii) MOLECULE/TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TCAGAAGAAC TCGTCAAGAA 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12 . . 53 

(xi) SEQUENCE DESCRIPTION: SEQ ID^ 

GGGATCCAAC A ATG GAG CAA ATC CAA ATGy 
Met Glu Gin lie Gin Met 
1 5 

AGA 
Arg 



fG ATC GAA ACG TGT 
^al lie Glu Thr Cys 
10 



50 



53 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 baafe pairs 

(B) TYPE: nucleic/acid 

(C) STRTUSTDEDNESS / single 

( D ) TOPOLOGY : 1 ihear 

(ii) MOLECULE TYPE: ©NA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTAAAACGAC GGCCAT 



16 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) sequence/characteristics : 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12.. 45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GGGATCCAAC A ATG GAG CAA ATC CAA ATG GTG AA^ ATT CTC GAA C 
Met Glu Gin lie Gin Met Val Asfri lie Leu Glu 
15 20 / 25 



45 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genoij^ic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: /SEQ ID NO : 3 
CTCGGAGGAA TTCGGCACGA C 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : / 1 inear 

(ii) MOLECULE TYPE/: DNA (genomic) 
(iii) HYPOTHETICAIf: NO 

(ix) FEATURE: 

( A ) NAME /ykE Y : CDS 

(B) LOCATION: 18 . . 35 



i 



